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i " Strategies for Algebra Problems 

Abstract - j 
\r) Exper iment . 1% 42 subjects solved algebra word problems in/STtRer work or 

equation format.. In Experiment 2, kl subjects , solved or si triply translated word 

f ' ■ * • , 

problems Into equations. The pattern of response latencies, by problem length, 
was different for the treatment groups. Results conflicted with a two-stage 
modeTof problem solving consisting of separate translation and solution 



processes. 
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1 This paper Is concerned with the development of a^ theory of algebraic 

j- . .\ . 

problem'solvingv The rationale for stucjyfcng algebraic problem solving includes 
the fact that" work in this area would contribute to existing vyork on prose 
/comprehension (Hinsley, Hayes 5 Slmdrf7|l^77) ^ that a theory of algebraic 

problem soljvljig would become. a major component 6f a larger theory of deductive* 
reasoning (Mayer S Revl ia, 1 978Ji T and that the resulting theory could be 
tested in-*a wide variety .of real world situations involving mathematics in- 



structioti (>Ki 1 pa trick** 1970) 

V 

(1) Equation Problepij *- consisting of single or multiple eqj/e 



Traditionally, studies_ofj algebraic problem sdlvjng have dealt v^ith one of* 
three tasks ^ y 



at ions 

and a goal of solving for a certain value.- For 1 example, yk ^9. 
(2) Word Problems — consisting of a* si ngle or .multiple sentences that - 
• can be translated into equations and solved without any additional 
' semantic knowledge. ,For example, find a number such that 3 times v 
the number is the same as 9v 



(3) Story Problems — consisting of single or multiple sentences that 
require additional semantic knowledge to be translated into y 
equations and solved* For example, John drove his jet 9 mi tes 
on 3 gallons of gas. What was his mileage? (You must know 
that mileage = distance t gas .used.) 
The domain of the present study will be restricted to single sentence word 
problems, although these tasks are closely related to the others. 
Translation and Solution 

An extensive literature review is beyond the scope of this paper. How- 
ever, much prior work falls into two categories: computer simulations of 
algebraic problem solving processes (e.g., Eobrow, 1-968; Hayes & Simon, 197**; 
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Bbndy, Note 1,; Bregar £ Farley, Note 2; Novak, Note 3); and instructional 
studies aimed. at Improving mathematics education, su£h as published In Journal 
for Research In Mathematics Education or NCTM meeting proceedings (Barnett, 
Vds £ ^owdder, Note k\ Clement, Lochhead £ Soloman, Note 5). Unfortunately, 
there has' not been much basic experimental research to-fill the -gap between 
'high-level theory (i.e. computer models) and educational application. 

Perhaps the strongest single theme of the existing experimental research 
an-^aigebraic problem solving is that the process of translating the problen) 
into a representation is a crucial one. For example/ Paige £ Simon (1966) 
found that subjects who drew integrated pictures to represent word problems 
were more likely to perform correctly on the problems^than subjects who drew 
fragmented pictures. Hayes and his collegues (Hinsley, Hayes £ Simon, 1977 
Robinson # £ Hayes, 1978) found that students have "schemas" for various types of 
story problems and that they use these schemas in selection of Information they 
will pay attention to. Work on arithmetic word problems by Greeno andhis 
associates (Ri1ey£ Greeno, Note 6; Heller £ Greeno, Note .7) demonstrates that 
children can interpret an add/subtract^operat'ion in a word problem in different 
ways such- as "cause/change" or "combine" or "compare," and that the difficulty 
of the problem depends on which interpretation is involved. In similar studies, 
Loftus £'Suppes (1972) and Rosenthal £ Resnick (197*0 found that difficulty of 
word problems was affected^ by the linguistic structure of the problem. Schwartz 
(1971) found that subjects who used a matrix representat ion for information 
in a. deduction problem were more successful. Also, Mayer £ Greeno ^1975) and 
Mayer (1978a; 1978b) reported that the same' algebra ic information was stored in 
memory in qual i tatively different ways ^by different subjects^ Insreview of 
representative studies in human jdeductive reasoning, Mayer £ Revl in (1978 
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p. 21) concluded: , 1 - 
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* The most s-trlking commonality among the studies Is that 

> 

emphasis on how. reasoner? encode th^ presented Information... 
each researcher seems, to fbcus on a slightly different aspect 
• of «t:he encoding process. t v' 

This conclusion seems particular^ relevant to work on algebraic, problem solving 
A^second strong theme of experimental work involves mainly the, process of 
solution, (rather th^n translation), A detailed analysis of the solution 
prdtod|pls of I fid i vidua I >prob\em ,sol vers in vol ved in solving physics problems 
suggests that students acquire a set of heuristic strategies that can be 

represented formally (Simoh, t Simon, 1978; La^kin, Note 8). Similarly, Greeno 

' A * \, ^ ♦ ' * 

(1976, 1979) $ias provided a detailed analysis of the problem solving processes » 

* ■* . 

used by high school s : tude?nts solving geometry problems. These studies may be 
summarized by statingHhat detailed models of the problem solving process for 
story problem^ can be»fit to ^the jdata ,of real students solving real science" 

\ - • •' t ' ■ ' * ' 

problems; furthermore,, a major feature of tHe solution process for skilled per- 
formers is a reliance ^—^on various forms of heuristic planning. 

' v - 

The research on encoding % a^^^so lotion processed, as well as one's own 
introspections, might lead^one^o^a^^ime that solving word problems consists of 
two successive stages: translation from words to equation, and solution of the 
equation using the rules of felgebr^^nd the strategic planning of a^good 
problem solver; Indeed, many algebra textbooks explicitly teach this ^procedure 
of translatiog followed by &bTut ton*- -Further , computer models of algebraic 
problem solving generally make a distinction between a program. to translate a 
.problem from tngl ?sh into an c interniiT^representation, and a program to solve 
the problem by applying operations 'tp-Tthe internal representation. For ex- 
ample, Bobrowjs (1968) "STUDENT and H^s & Simon's (1977) UNDERSTAND program 
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make , a clear distinction between translotlon and solution and tenjj to emphasize^ 

the first process; In addl t Ion ,f&undy (Note 1) and Dergep & farley (Note 2) 

rely on the trans iatlon/.sol ut Ion distinction but tend t4 emphasize the solution 

/ ' 

process In their programs. ^ s i . 

For the purposes of the present paper, tjbe term "two-stage model" will 
refer to the Idea that solving word problems Involves: 

(1) Trans lat ion Stage which takes words as Its input tfnd gives a , 
formal representation (such as an equation) as it's output, using # 
encpd jng rules . * . 

(2) Sol ut ion Stage which takes the output of the translation stage 
as its Input and gives a numerical value as its output, using 
algebraic rules and strategic rules . 

(Although the two-^stage model, seems to be a wel 1 -accepted fact Of life — 
forming the basis for computer models as well as textbook lessons *- there has 

not been sufficient research fevidence to establish its validity. 

s . 

Problem Representation 

* , • . . 

This paper provides £n experimental investigation of the two-stage model 

in the context of one type of word problem. In particular, this study in- 

vestigates whether subjects use different solution procedures when an algebra 

problem is presented in equation format versus when It is presented as a word 

problem. The effects of problem representation on algebraic problem solving 

have been investigated in several previous studies. 

In a previous set of experiments on algebraric substitution, subjects 

learned a set of four interlocking equations (Mayer," & Greeno, 1975, Experiment 

1). For some subjects the information was presented as a $tory such as, 

Mileage— Distance/Gas Used, Total Time = Preparation Time + Driving Time, 

Driving Time = Arrival Time - Leaving Time, Speed - Distance/Driving Time. 



o 
O 
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Fof other subjects, the Infocnjatlon was presented as an Isomorphic set of 

I ' 
nonpenSp equations, such as M » D/G, T « P + V, V « A - L, S « 6VV. The re- 

i 

suits Indicated that there were no major differences between groups -In terms of 
response latencies 6r reported solution strategy for simple problems such as, 
Preparation Time ° 1/2 hour, total Time « 2 hours., Find Driving Time". Thus 
If we had stopped at that point we would have concluded that the same problem 
solving model could account for the performance of both groups. However, two 
additional types of problems were also used: ''unanswerable problems" such as 
"Speed =» 50 mph, Driving Time = 1 hour, Find Mileage", and I'quest ions" suctf a9 
"Given Arrival Tjhie arid Leaving Time what else is needed to find Total Time?" 
For^ these types of problems, which requi re checking many refations, the story 
subjects were much faster (e.g., twice as fast for the unanswerable problems) 
as compared to the letter group. Thus, this study provided our first hint that 
the story format provides for a more integrated memory representation of the 
four equations and/or faster search speed when many relations must be tested. 

If the story subjects .tend to integrate the information with their ex- 
isting knowledge, as is suggested by theprevious result, one prediction is 
that they should be less influenced by the presentation order of the equations. 
In order to test 'this idea, another study (Mayer £ Greeno, 1975, Experiments 3 
and k) was conducted in which subjects learned 9 interlocking .equations pre- 
sented either as meaningful stories or as nonsense .equat ionsf Subjects learned 
3 equations to criterion, then learned a second set of three, and then learned 
a final se~t of three. On a subsequent test subjects were asked about" only 
three of the equations such as , W = 0 * D, M = 0 * H, R » W/T. Problems 
required ejther ng substitutions, such as "Given. M and H could you find 07", 
on£ substitution such as, "Given M, H and D could you find W?", or two 
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substitutions such as, "Given M, M, 0 and R could you find T7". For some 
subjects the 3 equations had all been learned In the same set (same set) and 
for other subjects the 3 equations came from 3 different sets during learning 
(separate sets). For the story subjects, performance was the same on problems 
requiring substitutions regardless of whether one or separate sets were Involved; 
however, for the letter group response time on substitution problems was much 
longer (e.g. twice as long on two-substitution problems) for the separate set 
presentation as compared to the one set presentation. Thus the story subjects 
appeared not to be influenced by the organization of presentation while the 
letter subjects were. This result provides replicatory support for the idea 
that the story format allows for a more Integrated memory representa t^l^n^ and 
thus faster performance on problems requiring search across several equation. 

Since the above results encouraged the idea that presentation format in- 
fluenced problem solving performance, a'subsequent series of experiments was 
conducted which investigated the same issue using a simplier and better ana- 
lyzed task — linear reasoning. For example, in one study (Mayer, 1978,- 
Experiment 2) subjects learned the remote pairs of linear ordering such as: 
F>H>R>B. For some subjects, premises were presented as a story such as: n The 
frog gets 10 times as many votes. as the rabbit. The hawk gets 20 times as many 
votes as the bear. The frog gets kO times as many votes as the bear. 11 Othj^r 
subject received Isomorphic equations in nonsense form, such as, M F =-10 * R, 
H = 20 * B, F = ^0 * B n . Subjects were asked to answer questions involving no 
substitutions such as "Is F>R?" ) involving one substitutions such as n ls F>H? M .^ 
and involving two substitutions such as 'Ms H>R?". If subjects remember the 
three premises and then use them to make inferences, the response latencies, or 
error rates should increase with the number of required substitutions. This is 
the pattern obtained and best fit by the story subjects, but it was not obtained 
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for Che Utter subjects. Tim latter group's data waa hast fit by a modal In 

which each term was tagged, and question answering Involved t«ig matching rather 

than making Inference*, Thus, In this case problem format Influenced both tha 

mode of representation and the problem solving procedure. 

The previous two sots of studies highlighted the negative aspects of 

equations I.e., requiring more time In complex situations, and encouraging a 

superficial solution strategy. However, the present experiment uses a task In 
« 

which equations may hold an advantage; first, a single long equation is used 
rather than a list of subst I tutable equations, and second, the necessity of 
real-world knowledge Is minimized. I>fcthis case, equations may hold an 
advantage over word format because they require less memory load (Hayes, 

■ 3 

1978) : 

Problem Space • • .j 

for example, consider the operators that a person uses to solve a word 
problem such as the following: 

Find a number such that If 8 more than 3 times the number is 

divided by 2, the result is the same as 11 less than 3 times 

the number. 

For purposes of this paper, we will call this an algebra word problem, and^ 
subjects who are asked to solve these types of problems will be called the word 
group. 

An isomorphic way of stating the problem in equation form is: 
(3 + 3X)/2 = 3X - 11 

v 

We will call this format -an al^ebr^a equation problem, and subjects who are 
asked to solve these types of problems will be members of the equation group. 

1 u ■ 
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Thsra (H'c? two hf|t»lc classic*;* of ac,tl'-w> Unit Offin lu* ustul ui g<io«ii;ata 
problem states In Chh problem; 

MQVEs a varlabla or number Is movotl from on« side of tha 
equation to tbcs othar by a ctd<||clon 9 nubsicractlah, multi- 
plication, or dlvlslon of both a'N«a by tha st^'valua, a»d 

COMPUTES combining two consecutive numbers or two Instances 
of fhe same variable on one side of the equation by' 
carrying out a computation such as addition, subtraction, 
multiplication or division. 
For example, given the problem state, "N 

8 « 6X - 22 - 3* 

a MOVE operator would be to add 22 to bo<th sides, yielding the new problem 
state, 

8 + 22 « 6X - 3X 

An alternative operator that cgu^ d be app l l e d to the problem state, 

8 « 6X - 22 - 3X . ■ 

Is the COMPUTE operator; since there are two variables on one side of the 
equation they can be combined, yielding the new problem "state, 

8 =* 3X - 22 

Each problem state may be characterized by Indicating the minimum number 

4 

of MOVEs and of COMPUTES that would have to be applied in order to move from 
the given state to the end state (i.e., X *» a number). For example, the 
initial state, 

\s + 3X^/2 = 3X - 11 \ 
requires 5 computations and k moves to generate a value for X., Thus it is 
labeled as problem state 5^; the first number indicates the number of required 
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computations and the second number gives the number of requi red moves.. If one; 
KOVE is made, such as to multiply both sides by 2-, the resulting, state, 

8 + 3X - 2(3X - 11) ^ 
requires 5 computations and on1y?moves; thus, it Js labeled as problem state 
*53. Further, if the ^COMPUTE operator is applied twice on the right -side of the 
equation, the resulting i^gfce, 
8 + 3X - 6X - 22 

requires only 3 computations and 3 moves; thus it is labeled as problem state 
33. The remaining 3 moves are to get all the X's on one side, get all the 
numbers of the other side, and then to divide both sides by the number of Xs; 
the remaining 3 computations are to add 8 and 22, to subtract 3X from 6X, and 
to carry out the final division 30/3 to get a value for X. 

Table 1 gives a list of -Some of the possible problem states, and labels 
each according to the minimum number of MOVEs and COMPUTES that are required. 1 
Labels with prime marks (') indicate that they have the same number of MOVES 
and COMPUTES as some other non- i den t i ca 1 state. For example, 

8 + 22 - 3X 

requires a COMPUTE (add 8 and 22), a MOVE (divide both sides by 3) and a 
COMPUTE (divide 30 by 3). Similarly, 
30 - -6X - 3X 

requires a COMPUTE (subtract 3X from 6x) , a MOVE (divide both sides by 3) and a 
COMPUTE (divide 30 by 3). Thus both problem states require 2 COMPUTES and 1 
MOVE, and are labeled as problem 21 and 21' respectively. * 



Insert Table 1 About Here 
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There is more than one single path to the solution state. For example, 
one could move from state 53 to state 52 by apply ing a MOVE operator," or one 
could move from state 53 to state 33 by applying COMPUTE operators. A partial 
problem space is given in Table 2. The given, state (5*0 is on the left and 
the goal state (00) is on the right; thus, any change in state to the right 
represents progress towards the goal. The problem space does not include any 
state with a fraction and does not include backwards moves from state 54. 
Table 2 consists of number pairs-representing problem states — connected by 
labeled li nes-- represent i ng MOVEs or COMPUTES. The first number indicates the 
number of required COMPUTES and the second number indicates the number of 
required MOVEs , as is shown in Table 1. The M above a line refers to the 
application of a MOVE operator and the C refers to the application of a COMPUTE 
operator. For example, one sequence of problem states is 54,53,52,51,31,21,11,10,00; 
another possible sequence is 54 , 53 , 33 , 32 , 22';/21 , 1 I f 1 0,00. As can be seen, not 
all of the 14 states shown in Table 2 are needed for any given solution path. 

i 

Insert Table 2 about here 



In the present experiments, however, subjects were asked to solve problems 
which began at each of the 14 states listed in Tables 1 or 2. Some subjects 
were given problems in word form (word group) and others were given problems 
in equation form (equation group). Thus the resulting data is a 2 x 14 array 
showing the response latencies for each group by problem state. 
Contrasting Versions of a Two-State Model 

Simple Two-State Model . The simple two-stage model states that solving 
an algebra word problem involves two s ta tes t rans lat ion plus solut ion--whi le 
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solving an isomorphic algebra equation involves only one state — solution. In 
addition, it is assumed that the solution processes are identical for corres- 
ponding word and equation problems; the o^t]y difference between subjects* 

solving word and /equation problems is that word problems require a translation 

ft 

and equation problems do not. / 

/; 

Finally/it is assumed that translation requires a 

2 

constant amount of time regardless of problem state. 

According to this versioji of the two-stage model,, the response time to 

answer a word problem is represented as, 

RT (p) = RT + RT (p) 
w r t s 

while the response time ta answer a corresponding equation problem is, 

RT e (p) = RT s (p) . 
where RT t is the time to go from words to an equatioTi for all problems, RT s (p) 
is the time to go from an equation to responding with the correct answer for 
problem p, RT is the total time to solve a word problem p and RT fi is the 
total time to solve a corresponding equation problem. 

This model allows for a straightforward prediction concerning differences 
in performance of the two treatment groups on each of the 1 problem states: 
There should be an overall main effect in which the equation group is faster 
than the word group , and since this di f ference i s constant over all probl em 
types there should be no interaction between treatment and problem state. 

Modified Two-Stage Model . The simple two-stage model makes an assumption 
that translation time is a cons tan t* re^a rdless of probl em state. In orde r to 
eliminate this dubious as sump t i on , a mod i f i ed two- s t age model is i n t roduced to 
this section. The modified two-state model is identical to the earlier model 
except that translation time varies as a function of problem state. Like 
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the simple two-state model, it is still assumed that the solution process is 

4 

the same for both types of pj^blems. r ^ 

The solution time to sol-ve word problem p may be represented as follows, 

RT w (p) 53 RT t (p) + RT s (p) C • " ' 



and the time to sblve a -correspond i ng equation problem is, 
RT e (p) = RT s (p) 

where RT^tp) is the time'to ti^staTe^a given problem into an equation, and 
RT s (p) is the time to produce a correct response for an" i somorph i c equation. 

This model also predicts an overall main effect in which the equation 
group performs faster than the word group. .In add i t i on , *th i s model also pre- 
diets an interaction between treatment and problem state. The form of the 
interaction, however, is that the difference in latencies should increase as a 
function of problem length (i.e., as the complexity of the translation process). 
Thus, word problems requiring much transaction should generate higher differ- 
ences from corresponding equation p rob 1 e$ij|Phan word pr*ob 1 ems t ha t are short. 
Since there are several ways of defining problem length, thesejwil'l be dis- 
cussed in the results section.^ 

Different Solution Procedures . The previous two models have been based on 
the idea that responding to algebra word problems involves translation plus 
solution, and that 'the solution process for word problems is the same as the 
solution process for equation problems. An alternative presented in this 
section is that the solution process for a word problem is not the same for its 
corresponding equation problem. 

According to this model the latency for responding to a word problem may 
be represented as follows. 

RT (p) = RT. (p) + RT (p) 

W L sw 
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and the time to'solve a corresponding ^guat ion .problem is, 

• • •$ 

, RT e (p) = RT se (p) 

where RT t (p) is the time to translate d problem p, RT sW (p) is the time to solve 
the problem using the word-solution procedure, and RT se (p) is : the time to solve 
the problem using the equation-solution procedure, z 

Since this model assumes qualitatively different solution procedure for 
the ,two treatment groups, it is important- to specify the nature of the differe- 
nces. The nature of the differences can be conceived in terms of differences in 
the conditions that are attached to the possible actions (i.e., MOVE and COMPUTE) 
In general, problem solvers in this task report two general types taf goals: 
(l) Reduce expres s i on- -r t ry i ng to carry out any indicated operations Or clearing 
any parantheses as soon possible, and (2) Isolate variables — trying to move, 
air the tfs on one side and move all the numbers onto the other side. It may be 
noted that the conditions for reduce expression involve looking at orlly one 
side of the equality such as noting whether there is a parenthesis or whether 
there are two numbers on one side (i.e., 8 + 22 is on one side in problem state 
21);. in contrast, the conditions for applying operators with respect to the 
isolate goal always involve looking at both sides of the equation such as 
noting whether there is an X on both sides. 

For purposes of this paper, it is reasonable to suppose that equation 
format requires less memory load than word presentation format. Since word 
format is cumbersome, it is more likely that the controlling strategy of 
subjects in this group is to reduce the expression. Since the equation allows 
for a unified visual representation of the entire problem, it is more likely • 
that the controlling strategy of subjects in this group is to isolate the 
variable. Specific production system models which are based on these 
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difference in the conditions will be, discussed in the next section of this 
paper, and are presented in detail in a companion paper (Larkin & Mayer^ I 98q ) . 

Based on these differences it is pcksible to predict differences in. the 
pattern of solution performance that must result in a 'treatment' * problem state 
interaction. In particular, the equation group should show evidence of planning 
and of setting "isolate variables" as its tdp goa) while the word group should 
show a pattern of performance that does not involve planning but rather sets as 
^tS goal to "reduce expression". 
Reduce Versus Isolate Strategies 

In this section we present a more formal model wh i ch out 1 i nes the differ- 
ences in performance between the equation subjects—who presumably use an 
isolate strategy — and the word subjects—who presumably use a reduce strategy. 
As a first step, Table 3 .presents a general condition-action list (or means-* 
ends table). The left side of each statement gives a general descr ipt ion of a 
situation (or difficulty) that may be encountered in solving the algebra 
equation, and the right side gives a genera-1 descrfption of the actjon^o be 
taken. Thus^the condition-action pairs listed in Table 3 are much more general 
than those needed for a running program but they are consistent with Newell S 
S imon (1972) system and will a 1 low us to .derive some predictions in this 
experiment. ' ^ 

We assume that a subject, represents the information in a fown that will 

' t - 

allow for testing of the s i fl x conditions listed in Table 3. When a subject is 
given a problem in solne state p, the subject searches for the cond i 1 1 on %^^^ 
are met. If more than one condition is met, 'the equation subjects will cnoose 
one to work basad on the priority ordering, 1-1, 1-2, R-1 , R~3, R-A; similarly, 
the priority ordering for word subjects is R-1, R-2, R^3, R-^, 1-1, 1-2. 



\ 

I 



1 <; 



Strategies for Algebra Problems 

. V • i6 ' 1 • 

Another way to state this di f ference 'fs to" say that equat i orrs-stfSjecft look 
first for isolate conditions and Word subjects look first for reduce conditions. 
In the present experiment, there are several cases in which actions associated 
with isolate conditions (l-l and 1-2) cannot be carried out without further, 
setting o^ subgoal s. However, if subjects use the reduce strategy it is never? 
necessaryL_tp stack subgoal s. 

Ah example of these differences is given in Tables h and 5. for the 
problem in state 5^, three conditions are nj^t: there are Xs on both sides (l- 
1), there are numbers on both sides (l-2), and there is a parenthesis (R-3). 

n 

Equation subjects select 1-1 as the condition-pair to be executed, but when 
trying to apply the action operators find that there is a const raint--i .e. , 
there is a parenthesis. In order to remove this constraint, a new subgoal must 
be stacked on top of the 1-1 subgoal, and so on v Afe can be seen, problem 5h 
involves 2 failures to carry out a subgoal. (i.e., 2\ instances of subgoal 
stacking), problem 53 requires 1 instance of subgoal stacking, and 33 requires 
■^Ljione. For word subjects, there is never a failure to carry qut a subgoal and 
thus no neeB for subgoal stacking. The results of applying the reduce and 
isolate strategies to all 1*4 problem spates is summarized in Table 6. In order 
to predict response latencies it is necessary to predice that each move re- 
quires some constant amount of tim^, each computation requires some constant 

amount of time, and each instance of goal stacking (planning) requires an 

k — 

additional amount of time. 

Like the modified two-stage model, this formulation predicts an inter- 
action between treatment and problem length. However, unlike the previous 
model, the pattern of interaction should reflect different solution strategies. 
In particular, the interaction should reflect the fact that there is goal 
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stacking r^uire^for problem states S 2 * (ZVime^, 53 (I time) , and 51 (l time)* 4 
for the equation grqyp but not the word group- 



4 t 

Tables 3, 5, 6 about here 



pi Experiment 1 

Subjects solved problems in each of the 1A problem states jivith some ' 
subjects receiving all problems in word format and ''*ffi e sub|el|t^ receiving 
them In equation format. Experiment 1 was conduct^dj^jTi order to test the 

/^JI0- ' % ■ 

predictions of each of the models discussed &bv&k/Zlhe :^>I»pl e two-staqe model 



predicts an overall main effect but najniferM two-stage 



model predicts an overall main ef fecfe^^v^ increasing 
difference as a function of problem lengthV VFinal ly, if the subjects in the 

- ••: . 

groups use qualitatively different solutionvstrategFes such as i sol ate versus 
reduce, more specific d i f f erences *i n the/goat stacking procedures can be 
described. / s . \ „ 

Method : # \ 



< Objects and design . The subjects were kl college students recruited from 
the Psychology Subject Pool at the University of California, Santa Barbara. 

Twenty-one subjects served in the equation group and 21 subjects served in the 

+ 

word group. All subjects solved the same \k states of problems so state of 

\ * 
problem is a within-subject factor. 

Materials and apparatus . Two sets of 98 problems were constructed. The 

equation set presented the problems lin equatiorvform such as: 

(8 + 3X)/2 = 3X - 11 1 

The word set presented problems identical to those in the equation set but 

used words such as: 
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Find ^;number 4 ^^^ : f 8 more tharO times tte number is-divided, by Z 
w the resul ts/^s ^the same a^ 11 less th&h 3 times the number. 
The 98 problem* were generated using a 7 by design. The first factor in- 

dicates that seven different^ problems were used swch as the one indicated 

i « . 

^bove. All problems were of the samfe form ^ut spe^fic values were different. 
The second factor indicates that each problem Was broken down into l*t different 
problem states by manipulating hd^ many computations and moves had to be made. 
Table 1 shows the \k states of each problem, and Table 2' shows a partial problem 
space. 

the apparatus consisted of an AOM-3 terminal screen and keyboard for 
presentation of the stimuli- ahd entering of responses; an ADM-3 terminal that 
was u^ed as an experimenter monitor; a Micropol i s dual floppy disk drive used 
for storing the stimuli, the experiment program, and the data; a Teletype 
printer used for printing out the data; and an Altair 8080 computer used to 

( • 

control these devices. 

Procedure . Each subject was run individually in a sessiorv that lasted 
approximately 30 to 60 minutes. First, subjects were given verbal instructions 

to make sure that they could solve algebraic equations similar to those in the 

t 

experiment. Three subjects were unable or unwilling to solve the trial equa- 

t 

tion, so new subjects were run in their places. Instructions stated 4 that an 
equation (or a word problem) would be presented on the screen. The subject was 
to logically solve the problem for the unknown, and then press the corres- 
ponding button on the keyboard. The buttons were labeled from 1 to 30 by 
integers and included one button labeled "none." Subjects were explicitly told 
not" to guess or to use any shortcut method; rather they were told to carefully 
work out the solution step by step and to press the answer button only when 
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they were sure of their answer.^ Subjects were allowed 60 seconds to.answer; 
as soon as the subject answered, or as soon as 60 seconds passed", the question • 
was removed from the screen and a new question appeared in 2 seconds. Each new 
question was preceeded by a beep. 

The questions were presented in random order except for the constraint 
that each set of 14 items Consisted of all 14 problem states and that each set 
of 14 items contained 2 different instances of each problem- After each set of 
14 problems, subjects were given a brief rest period. / 

First a set of .1** practice 
items were "given, one at a time. Then seven sets of 14 experimental items were 
given. At the end of the experimental session subjects were asked about their 
solution strategies to make sure that subjects had followed directions. 
Results and Discussion 

» 

The average response time for each of the 14 problem states was computed 

** . 6 
for each subject in each group. Table 7 summarizes the meah response time for 

^ach of the two groups by problem state. An analysis of variance was performed 

on the data with treatment as a between-subject factor and problem state as a 

9 

with-subject factor. As expected, the ANOVA reveal ed ^that the equation group 
was significantly faster than the word group in overall performance, F(l,40) = 
71,244, p < .001, and there were overall differences In response time for 
different problem states, F (1 3, 520) = 197.11, p < .001. 

The main focus of this experiment, however, is on the question of whether 
subjects in the two treatment groups followed different solution procedures. 
The "translation plus solution" theory states that subjects in the word group 
simply translate the problem into an equation and then solve the equation in 
the same was as the equation group. A simple version of this theory assumes 
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that translation requires a constant amount of time? for each problem, and.thus 
predicts that the word group should take longer than the eqqation group overall 
but there Should Be no interaction with problem state. The ANOVA revealed a 
significant pattern of interaction between treatment and problem state, F(l3, 
520) = 17.78, p < .001, and thus provides evidence against a simple version of 
the "translation plus solution theory. 11 



Insert Table^7 about here . 



These results. allow one to reject a simple version of the "translation 
plus solution 11 theory. However, a modified version of the theory predicts 
an interaction if its assumed that the translation stage takes longer for 
longer problems.. This modified version of the two-stage theory predicts an 
interaction between treatment and problem state in which there is a monoton- 
^ically increasing difference as problem length (indicated by number of compu- 
tations ar^i moves) increases. The difference in response time (in seconds) 
between the two treatment groups for each of the 14 problem states are listed 
in the last row of Table 7. ^A linear regression 3 using number of steps (i.e., 
computations or moves) as the independent variable and the difference in RT as 
the dependent variable produced an R 2 of only .65, and a multiple regression 

using number of computations and number of moves as the independent variables 

** 2 

and the difference in RT as the dependent variable produced an R of only .70. 
These results indicate that the treatment by problem state interaction cannot 
be neatly described as consisting of monotonical ly increasing differences, if 
problem difficulty is defined as a function of the number of steps. 
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An alternative theory that predicts an interaction between treatment and 
problem state is that the two groups engage in qualitatively different solution 
procedures. For example, the data presented in Table 7 suggests that there are 
three general stages for the equation group indicated by a-sharp drop from 
state 5^ to 53, and from each of the 5-computat ion states (53, 52 or 51) to/fts 

L 

respective 3-computat iorl state (33, 32 or 31)- One hypothesis developed earlier 
is that the equation subjects may plan ahead and stack goals; for example, they 
can stop at state 5^ and see several moves ahead, at the other 5-computat ion 
states they can see several computations ahead, and at the 3-computat ion states 
they can see to the end of the problem. The three major jumps in response time 
suggest that subjects are able to form subgoals (Thomas, 197*0 • However, the 
word subjects do not show the same subgoal pattern; for them each additional 
computation or move tends to add a constant amount "to solution time. 

In order to provide more information on this observation, several multiple 
regressions were fit to the means for the 1*4 problem states for each group. 
These are summarized in Table 8. First, a simple linear regression was used 
with the^ i ndependent variable being* number of steps. (A step was defined as 
either a move or a computation such that 5^ state required 9 moves, and state 

10 required l). As shown in Table 4, the model fit the word group reasonably 

2 1 
well (R =» .95) but did not fit the equation group well (R = .83). Second, a 

multiple regression was used with the independent variables being number of 

computations and number of moves. As shown in Table 8, this model fit the word 

group slightly better than the one-variable model, but did not fit the equation 

group well (R = .8k). Finally, a multiple regression was used that included 

three independent variables: number of computations, number of moves, and stage 

level. Problem state 5k was defined as stage level 2, problem states 53, 52 
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and 5] were designated as stage level 1, and al 1 the lower states were designated 
as stage level 0. • This model resulted in no improvement of fit for the word 

group, but did result in an enormous improvement of fit for the equation group 

2 - A 

(R ° .99). Thus, the equation group was best fit by the three variable (or 

stage model) while the word group «vyas best fit by a simple step model or two 

variable step model. These results suggest th^t the pattern of /interaction 

between the two groups cannot be adequately described by a simple or modified 

"transl.at ion plus solution 11 ttfSbry. In contrast, there is evidence of different 

processing strategies as indicated by the fact that the group response patterns 

are best fit by different models a planning or stage model for the equation 

group and a step model for the word group. 

insert Table 8 about here 



The previous analyses explored trends in the group means. An additional 
analysis was carried out to determine whether the individual data encouraged 
the same conclusion as the group data. Each of the three regression models- 
one variable, two-variable and three-variable -- was fit to the pattern of mean' 

response latencies for H problem state for each individual subject. Table 9 

2 

shows the value of R , indicating the goodness of fit, for each model applied 

to each of the kl subjects. Table 10 shows the number of subjects in each 

group, (n = 21) who were best fit by each regression model; the top portion 

2 

(strict criterion) of Tab 1 e 10 def i nes n best fit 11 as yielding a R value that is 

more than .01 higher than the next simpliest model, and the bottom portion 

2 

(linient criterion) defines "best fit 1 ' as yielding an R value that is more 
than .03 higher than the next simpliest model. As can be seen, 20 of the 21 
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subjects in the' equation group are best fit by the 4 three-variable (or planning) 
model, while most the c fc^£_word v< group subjects are best fit by the one or two- 
variable (or step) models. Ohi-square tests were conducted for the data in the 
top and^btfttom portions of Table 10; there was a significantly different 

class i fic^t Lon pattern for the two treatment groups using the strict criterion, 

2 2 
X * 20.88, d£ =» 2, £ < .001 , and us i ng the 1 inient cri terion, x = 2 5-0^ f 

df = 2, £ < .001. Thus these results are consistent with the conclusions 

presented on the basis of group data. 



Insert Tables 9 and 10 about here 

A final analysis was conducted in order to determine whether the differ- 
ences between the treatment groups was an artifact of the many repeated trials 
in the experiment. For example, since each of seven problems was repeated in 
\k different problem states, it is possible that subjects by the end of the 

\ 

session learned to respond on the basis of distinctive cues in the problem 
rather than actually computing an answer. It should be pointed out that the 
instructions clearly stated that the subject should compute the answer rather 
than try to find ways of guessing., and further, that subjects indicated that 
they had followed directions when questioned after the experiment. However, in 
order to provide more data on this question, an analysis was conducted using 
data from the first experimental trial (there was a practice trial before this 
trial) and the last experimental trial; each of these consisted of one of each 
of the ]k problem states with all problems counterbalanced. The mean response 
latencies for each group on each of the problem states occurring in the first 
and last experimental trials is shown ?n Table 11. 
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As can be ; seen in comparing the pattern of the equation and word groups, 
the same general pattern is. obtained for both the first and last trial. In 
both cases, the word group shows a pattern of consistently more time required ♦ 
for each new step while the equation group shows a jump from 5** to 53 and from 
the 5-computation state to its corresponding 3-computation state. * These 
patterns are similar to those presented and discussed in conjunction with Table 
3. An analysis of variance was conducted using treatment as a between subjects 
factor and trial and problem state as within subject factors. As expected, the 
equation group was faster overall, F(l,13) » 75-23, P < -001., and performance 
on the last trial^was faster.' than on the first, F(1,A0) - 104.95, P < -001. 
In addition there was a pattern of interaction between treatment and problem 
state that was similar to that shown in Table 7, FO3.520) - 9-42, p < .001. 
There was, however, no evidence that the treatment x problem state interaction, 
was different for the first versus the last trial; this observation is con- 
sistent with the failure to obtain a three-way interaction involving treatment, 
problem state and trial, F(13,520) -1.05, n.s. In addition, a separate AN0VA 
was performed on the data for the first trial and a separate AN OVA was per- 
formed on ^e data for the last trial with treatment and problem state as the 
factors. In both cases there was the same significant treatment x problem 
state interaction; for the first trial data, F(13,520) = 7-03, P < -001; for 
the last trial data, F(13, 520) = 4.81, p < .001. These results provide evi- 
dence that the differences in performance between the groups was not based on 
an artifact of the design that allowed for learning of a short cut as the 
session progressed. 
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Experiment 2 

The results of Experiment 1 suggest that the solution process used by 
subjects in the equation group involves planning. However, the translation 
and solution process of the word group does not show the same - sort of planning. 
One reasonable conclusion is that the two groups use qualitatively different 
solution processes, as represented by the isolate and reduce strategies 

In order to provide another test the impl ications of the models discussed 
above, a second experiment was conducted in which one group wrote a numerical 

' - 4 

answer for each word problem (word-to-solution group), and another group wrote 
an equation for each word problem (word -to- equation group). According to the 
two-stage model performance of the word-to-solution task involves the following: 

RT w-to-s " R V p) + R VfP> + RT *(P> 
and the performance of the word-to-equation group involves: 

RT w-to-e " RT fc, (p) + RT « (p) 
where RT t (p) involves the time to go from the word problem to a statement of 

c 

the underlying equation, RT^nyolves moyjng from the equation to a numerical 
answer (a process that requires plann^g when performed separately) and RT w 
involves the time to write an equation or a number on a sheet of paper. The 

8 

previous experiment suggests that the RT g component involves planning. 

The previous experiment demonstrates that when the solution stage is 
performed alone (i.e., for equation problems) there is evidence of planning. If 
there- are two independent stages in solving algebra word problems, and if the 
solution stage requires planning, then the performance of the two groups in 
Experiment 2 should differ. The two-stage model predicts that the word-to- 
solution group should show a pattern involving planning— since a solution 
process is required—while the word-to-equation group should show a pattern in 
which performance depends only on the length of the problem. 
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As an alternative it may not be possible to cjeanly separate the trans- 
latlon and solution processes for word problems. In this case, both groups 
will show the same general pattern of performance since both groups engage in 
the same step-Jp^rsteP" translation process; however, since each step requires 
more computation for /the word-to-solution group then each step should take more 
t ime. 
Method 

Subjects and design . The subjects were k2 college students from the 
University of California, Santa Barbara. Twenty-one subjects served in the^ 
word-to-sol ut ion group and 21 served in the word-to-equation group. All 
subjects solved the same \k states of problems so state of problem is a within 
subject factor. 

Materials and apparatus . The same 98 word problems and the same apparatus 
were used in Experiment 1. 

Procedures The procedure was similar to that used in Experiment 1, except 

that subjects were asked to write their answers on a sheet of paper and then 

press any button on the terminal. For the word-to* equation subjects, the task 

was to write down an equation — using X as the unknown — to represent the 

» 

word problem. For the word- to- solution subjects, the task was 
to write down a number for the answer- All subjects were allowed 

to circle their final answer. In addition, subjects were told to press any 
button on the keyboard immediately after circling the answer they had written. 
Each subject solved 1** practice and **2 target problems; these were presented in 
sets of H each and were counterbalanced as in Experiment 1. 

R esul ts and D i scuss ion 

7 

The average response time for each of the 1** problem states was computed 
for each subject in each group, as in Experiment I. Table* 12 summarizes the 

o n 



Strategies for Algebra Problems 

i 

27 

mean response time for each of the two treatment groups by problem state, and 
is comparable to Table 7 for Experiment 1. K As in Experiment I, 9n analys.is of 
variance was performed on the data with treatment as (a between-subjects factor 
and problem state as a wi thin-subject factor. The overall mean < response time 
for the two treatment groups was identical (23.3 sec) so the overall main 
effect for treatment produced an F of zero. As expected thefe was an overall 
main effect due to problem state, reflecting the fact that longer problems 
required more time, F(l 3,520) 226.1 , p < .001. 

The main focus of this experiment was on the pattern of performance of the 
two grojups by problem state. Both groups appear to show a pattern that is 
similaW£o that displayed by the word group in Experiment 1 — monotonical ly 
increasing response time as a function of problem length. However, the treat- 
ment by problem state interaction is statistically significant, F(13,520) = 
1^.18, p < .001. The interaction can be described by saying that the word-to- 
solution group is faster than the word-to-equation group on simple short 
problems but the word-to-equation group is faster than the *word-to-sol ution 
group on long problems. This interaction is consistent ~wi th the idea that 
translation to equations is not needed and may interfer with problem solving, ^ 
especially for problems that do not require much planning. 



Insert Table 12 about here 

f 



In order to better understand the nature of the treatment by problem sfrS^te 
interaction, several muliple regressions were performed. The goal of these / 
analyses was %o determine which models best fit the performances of the two 
groups. In Experiment 1, the performance of the word group was best described 
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by a simple one-variable or two-variable step model while the equation group 
was best fit by a three-variable planning model. The group means by problem' 
state were fit to each of these models, 'as in Experiment 1; in addition, another 
one-variable model was tested in which the. number of calculations served as the 
independent variable. 

The two-stage model (or translation plus solution theory.) predicts that 
the word-to-solution group will demonstrate a solution process that uses 
planning heuristics — as indicated by a fit to the three-variable planning 
regression model while the word-to-equation group will demonstrate a linear 
step-by-step process -- indicated by a fit- to a one or two-variable step model. 
However, the alternative model predicts that the two treatment groups will show 
similar patterns of performace — both being fit by a simple' step model. 

The results of these analyses are summarized in Table 13- As can be seen, 

both groups were best fit by simple one-variable or two-variable step models, 

and neither group required the planning model that characterized the equation 

group in Experiment 1. Apparently, both groups performed more like the word 

group in Experiment I tha* like the equat ion group . For example, a comparison 

/ 

of Tables 8 and 13 shows that the word-to-solution group replicates the general 
form of performance of the word group in Experiment 1. The word-to-equation 
group shows a similar trend except that each additional step requires less time 
than for the word-to-solution group. One interpretation that is consistent 
with these findings is thatAoth groups engage in segment-by-segment trans- 
lation process but the word-to-solution group must also*perform some operations 
on each newly translated segment. 



Insert Table 13 about here . 
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A more detailed Inspection of the results shown in Table 13 shows the 

*• 

following* (l) The Intercept for the word-to-solution group is higher than for 
the word-to-trartslation groups; this is y consistent with the idea that the 
^wrlte P rocess ' takes longer* for equations which consist of many symbols than 
for writing a single number. (2) The time required for acWftional calculation 
steps is greater for the word-to-solution group; this is consistent with the 
idea that this group must actually make a computation as part of the ongoing 
translation/compacting process while the o.pher group does not. Further, the 
number of moves required increases the solution time for the word-to-solution 
group, presumably because each move must actually be executed as part of the 
translation/compacting process. However, the number of moves does not increase 
the required time for the word-to-equation group and in fact serves to decrease 
it slightly; this is consistent with the id^a that the number of moves does not 
increase the number of segments that must be translated. For example, problem 
states 5*t, 53, 52 and 51 do not differ in terms of the number of variables and 
relations that must be translated. 

) 

Supplemental Study v 
The previous studies suggest that equation format allows goal stacking for 
long problems. However, this conclusion is based only on the' pattern of 
response latencies. In order lb provide additional data on the problem-solving 
process of subjects, an interview study was conducted. 

The subjects were eight college students from the 
Introductory Psychology Subject Pool at the University &f 
Pittsburgh , with four subjects in the equation group and 

• s 

four subjects in the word group. 



Subjects were randomly assigned and run individually. 
Each subject was asked to solve t;wo of the state 54 equations 
used in previous experiments. For the equation treatment, 
an equation was printed on a chalkboard by the experimenter. 
The subject tyas asked to "think aloud" and tell the experimenter 
what to do with the equation. Each action that the subject 
called for was carried out by the experimenter writii^f a new 
equation on the board under the previous one. When a subject 
finished a problem,, he/she was asked to go back to the first 
step of the problem; in particular, the subject was asked 
whether he/she first thought about 11 getting rid of the 
parentheses" or about "getting the Xs on one side". A^imilar 
procedure, was used for the word treatment, except that the 
experimenter wrote the problem on the board in word form. The 
experimenter transcribed all of the problem states generated 
on the chalkboard and all of the subjects' comments concerning 
search for conditions; the entire session was also tape recorded. 

The main interest in this study is to determine whether 
there is any evidence that equation subjects engage in goal stacking 
Two of the four subjects in the equation group gave clear 
evidence of goal stacking but none of the word subjects did. 
For example, the solution process and. comments of one of the 
equation subjects is given in Table 14. As can be seen, this 
subject sets "isolate X" as a major goal but this strategy leads 
to several failures. Of the two subjects in the equation 
group Who did not show signs of goal stacking, one used fractions 
and one gave a fast, textbook proper description. However, 
even the subject who used fracticys gave a hint of the "isolate 
strategy"; "I wanted to get X <?n one side, but to do that I had 



to do something with that fraction.' Separate 8' .from 3X and 
remembering the basic rules of algebra you can't subtract them' 
out, when they are over a fraction. You have to separate them." 
Thus, this subject also expresses a goal stacking approach in 
which moving X cajjjiot be accomplished until it is separated 
from the fraction:,* None of the word subjects gave any evidence 
of setting "isolate X M as a goal, or 'of goal stacking.' 1 r 

In addition, several . equation subjects expressed difficulty 
in using the slash (/) as a division symbol. Hayes (1973) has 
found similar evidence for the role of spatial factors in solving 
equations. Many subjects expressed difficulty in verbalizing \ 
their thought process, and -there i s reason to believe that this 
task is not condictive to protocol analysis. However, the 
results provide support for the reality of go^l stacking in Hi 
equation subjects. 



Insert Table \k about here 



'Gene ra 1 Cone 1 us i on 




This study provides nqw informat ion concerning" the problem solving process ^ 
for algebra word problems, ' First, Experiment 1 compared solving word problems^ 
(which presumably require separate translation and solution phases) with solving 
corresponding equation problems (which presumably require only the solution 
phase). Results indicated that^ is was. not possible to characterize the be- 
havior of the word problem solvers as consisting of a translation phase followed 
by a solution phase like that of the equation problem solvers. Rather there 
was evidence, consistent with earlier results by Mayer S Greeno, (1975) that 
the solution process was qualitatively different for the two treatment groups. 
The word problem solvers displayed a pattern of monotomi cal ly increasing response 
increasing response times as a function of steps, as previously noted by Loftus 
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& Suppes ( 1 9 7 ) ; the equation problem solvers displayed heuristic planning 
similar In some respects to those suggested by Bundy (Note 1), 
^ Second, Experiment 2 compared the processes Involved In translation from 
wpi^ds to equations and the processes Involved In translation from word to 

answer. Solving word problems produced a pattern of behavior that Indicated a 

'M 

^modified version of tranJation alone. There was, again, no evidence that the 
solution phase involves the planning procedures as produced by the equation 
group in Experiment 1. Thus these two studies cast serious doubts on the 
applicability of the two-stage model of algebraic problem solving to the 
current task. 

Finally, these results suggest directions for future research on how" 
humans solve algebra word problems. The results of these studies, as well as 
other studies comparing word versus equation problem solving (Mayer £> Greeno, 
1975;. Mayer, 1978a, 1978b), indicate that theories of problem solving based on 
solution of equations may be different than theories based on solution of 
corresponding word problems. Hueristic solution models must take the problem 
representation into account. The present results encourage the idea that work 
on comprehension of prose might be relevant to work on the solutJon of word 
problems. This Is so because the present results show that the comprehension 
process and the solution process are far more intertwined than was previously 
assumed. It seems likely that when subjects are given complex word problems 
that overload their working memories, they do hot rely on the straightforward 
translation-plus-solution strategy. Rather, they appear to rely on a successive 
chunking' procedure in which segments are translated and operated upon in a 
plece-by-piece way. Several researchers have pointed to the important role of 
working memory in prose comprehension (Kintsch & van pijk, 1978; .Britton, 
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Holdredge, Curry & Westbrook. 1979). Furthermore, Individual differences In 
working memory (Hunt, Lunneborg & LeWl^s, 1975) might be Important In encouraging 
the use of different problem solving strategies. 
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Footnotes 



This research was supported by Grant NIE-G-78-01 62 from the National 
Institute of Education. The author appreciates the helpful comments of James 
Greeno and Jill Larkin. This paper was written while the author was on sabbat- 
ical leave at the Learning Research and Development Center, University of 
Pittsburgh. Requests for reprints should be sent to: Richard E. Mayer, Depart- 
ment of Psychology, University of California, Santa Barbara, CA 93106. 

1/ Problem states involving fractions or involving backwards moves from the 

given state are not included. 
2. The strategy used in this paper is to begin with as simple a model as 

possible, and to add more parameters only as needed. Thus, we begin with 



/ a simplistic assumption that translation time is a constant, but later 
modify that assumption below. 
3. Based on the results of Experiment 2, the best indicators of 

translation difficulty are number of moves and computations required. 
A. The isolate strategy and the reduce strategy are not the only possible 
models but they both are the simpllest and correspond to the reports 
of problem solvers. In addition, the assumpt i on that all computations 
take an equal amount of time, all computes take an equal mount of time, 
and each instance of goal stacking takes an equal amount of time, are 
made in the interests of simplicity. !f there are gross differences 
between individual MOVEs or COMPUTES or STAGES then the fit of our 
models should suffer and we would be encouraged to add even more 
parameters. Fortunately, this is not necessary in the present experiments. 
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5. One strategy that we did not want our subjects to engage Is that of esti- 
mating values to plug Into X. The instructions emphasized the fact that 

A 

subjects should use logical deduction, and a post-experimental question- 
naire confirmed that subjects followed instructions. 

6. Error rates were low, and the distribution of errors was similar for 
the two groups. The equation group averaged k% errors and the word 
group averaged 7% errors with errors defined as not giving a correct 
response within 60 seconds. 

7. Error rates were low, and the distribution of errors was similar for the 
two groups. The word-to-solution group averaged 5% errors and the word- 
to-eqjjatlon group averaged ]% errors. 
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Table 1 

Fourteen Problem States for (8 + 3X)/2 ■ 3X - 11 



Computations 
and Moves 



Requj red Problem State 

54 (8 + 3X)/2 » 3X - 11 

53 8 + 3X « (3X - 11) -2- 

52 8 - 2 (3X - 11) -3X 

51 8 + 2(11) => 2(3X) -3X 

33 | 8 + 3X - 6X - 22 

32 \ 8 = 6X - 22 - 3X 

32* 8 + 22 + 3X = 6X 

31 8 + 22 - 6X - 3X 

22 8 - 3X - 22 

22' 30 + 3X «■ 6X 

21 8 + 22 = 3X 

21* 30 = 6X - 3X 

11 30 = 3X 

10 30/3 = X 



• 0 




Note, - Problem states are represented by two digits In circles; the first digit Indicates the 
number of required calcuatlons, the second digit Indicates the number of required moves 
Operations are represented by letters next to arrows; M Indicates a move, C Indicates 
a calculation, CC Indicates two calculations. 
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Table 3 



Some Condition-Action Pairs for Solving Problem 5^ 



I 

Isolate ^Variable 



(l-l) Xs are both sides of the equation — > 

Move X to left side and combine with other X 

(1-2) Ns are both sides of the equation — > 

Move N to right side and combine with other N 

Reduce Expression 

(R-l) 2 Xs on one side of the equation — > 

Combine them 
(R-2) 2 Ns on one side of the equation — > 
^ Combine them y' 

(R-3) Parenthesis on one side of the equation attached to division — > 

Move divided term to other side of equation 
(R-*0 Parenthesis on one side of equation attached to multiplication --> 

Carry out the multiplications 



-1 KJ 



6 f 



Problem State 



4(54) 



(53)' 



(33) 



(22) 



(11) 



'I 



(00) 
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Table k 

Solution of Problem 5*» Using Isolate Strategy 

Events Calculation Move State 

3X - 11 (8 + 3X)/2 

Conditions: 1-1, 1-2, R-3 
Goal: 1-1 

j 

Fail due to PARENS (R-3) 1 
Goal: R-3 

Succeed I 
2(3X - 11) = (8 + 3X) 

Conditions: 1-1 , 1-2, R-k 
Goal: 1-1 

4 Fall due to PARENS (R-4) 1 
Goal: R-4 

Succeed 2 
6X -.22 - 8 + 3X 
/ Conditions: 1-1,1-2 

. Goal : 1-1 

, Succeed 1 1 

3X - 22 = 8 
. * Conditions: 1-2 

Goal: 1-2 

Succeed 1 I 

3X = 30- 

Conditions: 1-2 

Goal: 1-2 
" v Succeed 1 
<X =10. 



From Problem State 5*» 
From Problem State 53 
From Problem State 33 
From Problem State 22 
From Problem State 10 



5 
5 
3 
2 



2 
1 

0 
0 
0 
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Problem State 
(5*0 



(53) 



(33) 



(22) 



(11) 



(00) 



* Table 5 

Solutionr^Qf Problem $k Using Reduce Strategy 



Event 



3X - II « (8 + 3X)/2 

Conditions: 1-1, 1-2, R-3. 

Goal: R-3 

Succeed 
2(3X - 11) » 8 + 3X 

Conditions: 1-1, 1-2, R-** 

Goal: R-** 
. Succeed 
6X - 22 ■ 8 + 3X 

Conditions; 1-1 , 1-2 

Goal: 1-1 

Succeed 
3X - 22 » 8 

Conditions: 1-2 

Goal: 1-2 

Succeed 
3X = 30 

Conditions: 1-2 

Goal: 1-2 

Succeed 
* = 10 



Calculation Move Stage 



\ 



From Problem State 5*» 5 

From Problem State 53 5 

Frolfi \P rob 1 em State 33 3 

Frolta Problem State 22 2 

From Problem State 1 1 1 



k 
3 
3 
2 
1 



0 
0 
0 
0 
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Table 6 

Number of Moves, Computations, and Subgoal Stacklngs for 14 
Problem States by Two Solution Strategies 

re State Isolate Strategy Reduce Strategy 

COMPUTES MOVEs STATES COMPUTES MOVEs STATES 

5* 5 4 2 5 k o 

53 5 3 1 5 3 0 

52 5 2 1 5 2 0 

51 5 11 5 1 0 

33 3 3 0 3 3 0 

32 3 2 0 3 2 0 

32' 3 2 0 3 2 0 

31 3 10 3 10 

22 2 2 0 2 2 0 

22' 2 2 0 2 2 0 

21 2-10 2 10 

21' 2 10 2 10 

11 110 1 10. 

10 1 0 0 10 0 



) 



t 



Table 7 

\ 



Mean Response fk^ by Problem State for Two Treatment Groups —Experiment 1 
Treatment Problem State 





10 




11 


21 


21 


1 




22 


1 

22 


31 




32 


32 




33 


51 


52 




53 


54 


Equation Group 


2. 


7 


2.1 


3.1 


3. 


7 


k 


.6 


5.0 


5. 


0 


5.8 


6. 


1 


7.9 


12.6 


\k. 


7 


15.2 


25.0 


Word Group 


3. 


k 


k.o 


6.2 


10. 


7 


11 


.0 


13.8 ' 


14. 


6 


18.1 


18. 




20.7 


27. ^ 

J 


28. 


.8 


31-3 


3^.3 


01 fference 




7 


1.9 


3.1 


7- 


0 


6 




8.8 


9. 


6 


12.3 


12. 


3 


12.8 


li»?8 


14. 


, 1 


16. k 


9.3 



Note. - Main effect of treatment, p < .001. Main effect of problem state, p < .001 
Introduction of treatment and problem state, p < .001. 
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V . ,>: Table 8 " 

'■$ jValueiisfof f& and Variable Weighings for Three Multiple' Regressions Fit to Two Treatment Groups 



EQUATION 
GROUP 



ONE VARIABLE 

R 2 =■ .83 

STEP =2.63 sec 




COMPUTATION =3.02 sec 



Intercept a - ^ . ^0 sec MOVE » 2.0*» sec 

Intercept ■» _1 ».59 sec 



THREE VARJABLES 
R 2 ■= .99 

COMPUTATION » .69 sec 
MOVE = 1.^2 sec 
STAGE •= 7.25 sec 
Intercept => .90 sec 



WORD 
GROUP 




R " .95 

STEP = i».37 sec 

Intercept 0 3. 58 sec 



R 2 .98 

COMPUTATION =5.63 sec 
MOVE = 2.1»7 sec 
Intercept = 3- 9*» sec 



R* - .98 

COMPUTATION = 5.^2 sec 
MOVE - 2.*»3 sec 
STj\GE = .61 sec 
Intercept « -3.39 sec 



7 
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Table 9 



Individuals Ss 



Values of R 2 for kZ Subjects on Three Models ~ Experiment 1 

Word Groups Equation Group 



1 

2' 

3 

k 

5 

6 

7 
8 
9 
10 
11 
12 

13 
111 
15 
16 
17 
18 
19 
20 
21 




Regression T Regression 2 Regression 3 
(One Variable) (Two Variable) (Three Variable) 



Regression 1 
(One Variable) 



Regressjon 2 
(Two Variable) 



Regression 3 
(Three Variable) 



.881 


.930* 


.93** 


.926 


.969* 


.970 


.882 


.968* 


.972 


• 93 1 * 


.965* 


.967 


.902 


.902 


.982 


.699. 

.soP . 


.905* 


.911 ; 


.939* 


.978* 


• j j j 


.962* 


.96i» 


.648 




. 790* 


.935 


.9 1 »6* 


.9^7 


.895 




.937 


.922 


• 95 1 ** 


.95 1 * 


.890* 


.892 


.893 


.852 


.85^ 


.963* 


.862 


.892 


.921* 


.933 


M* 


.955 


.870 


.952 


.971* 


.911 


.968* 


.966 


.881 


.966* 


.957 




.9*»8 




.935 




.9k5 



- Asterisk (*) Indicates best fit using strict criterion. 



cr .4 



9 

frit 



.8*13 


\ .870 


.913* 


.635 


.685 


.825* 


.707 


.785 


.869* 


•753 


.758 


.888* 


.735 


. 7I17 


.8^8* 


.705 


.927 


.975* 


.763 


.765 


.878* 


.798 


.798 


.88i»* 


.863 


.910 


.951* 


.710 


.8^9 


.908* 


.715 


.715 


. 850* 


.797 


.825 


.917- 


.779 


- 73^* 


.869* 


.839 


.850 


.910* 


.818* 


.821 


.828* 


.728 


.736 


.861* 


.772 


.798 


.911* 


.803 


.832 


.929* 


.831 


.870 


.910* 


.693 


.698 


• 79 1 ** 


.861 


.863 


.9*»5* 
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Table 10 

Number of Subjects Who Were Best Fit By Each of Three Regressions 
In Two Treatment Groups -- Experiment 1 



Treatment 

Strict Criteria 

Equation Group 

Word Group^ 

Llnlent Criteria 

Equation Group 

Word Group 



Regression 1 
1 

2 
1 

8 



Regression 2 

0 
13 



0 
9 



Regression 3 

20 
6 



20 



Note; - For Strict criteria, x - 20.88, df = 2, p < .001. 

2 

For llnlent criteria, x - 25.0*», df = 2, p < .001 
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Table 11 

Mean Response Time by Problem State for Two Treatment Groups 
on First and Last Trials — Experiment 1 



Treatment j 



Problem State 



10 11 21 21 1 22 22 1 31 32 32 1 33 51 52 53 54 

\ 

First Trial 

Equation Group 3.2 2.6 3-6 4.8 5-6 5-5 7-9 7.2 7-7 10.9 15.1 19.9. 18.6 35.6 

Word Group 5.0 5. 4 9-6 13.7 12.5 19-2 20.9 22.8 24.0 28.1 35-3 39.6 36. 1 40.9 
last Trial 

' EqliaTlon Group 2.0 1.9 2.5 2.8 3-5 4.7 3-7 4.4 4. 6 5-5 1 1 .6 13-2 12.8 21.2 

Story Group 2.3 2.9 4.7 8.9 10.0 9.1 11.8 14.2 15-7 17-6 22.8 25.3 26.2^25.9 



Note. - Main effect for treatment, p < .001; main effect for trial, p < .001; 

main effect for problem state, p < .001; Interaction between treatment 
and problem state, p < .001 ; Interaction between treatment, problem 
state and trial , n. s. 



-1 
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Table 12 

Mean Response Time Dy Problem State for Two Treatment Groups - Experiment 2 

Treatment . Problem State 

10 11 21 J* 1 -22 22 1 31 32 32 1 33 51 52 53 

Word-to-Solutlon . 6.4 9-1 12.8 15-3 14.3 17.1 23-0 21 . It 24.1 24.4 35.9 38.6 

Word'-to-Equatlon 12.9 16.2 18.5 15.5 17.1 24.5' 24.0 24.8 22.1 40.5 35.9 30.5 

1 

Note, - Main effect for treatment, p « n.s.; main effect for problem state, p < .001; 
Interaction between treatment and problem state, p < .001 • 
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Table 13 
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Values of R and Variable Weighings for Four Multiple Regressions fit to Two Treatment Groups 

f 

Experiment 2 



TreatmSnft , 

■ ; rf 

Word- to -Sol ut Ion 



One Variable 
(Step) t 

R 2 - .95 

Step » 5.1 sec 

Intercept » -.9 sec 



One Variable 
(Calc) 

R 2 - .97 

Computation « 8.1 sec 
Intercept » -«9 sec 



Two Variables 



R ■ .98 

Computation ■ 7.5 sec 
Move « I . ^ sec 
Intercept » -1 .6 sec 



Three Variables 



R « .98 - 

Computation ■ 6.9 sec 
Move 0 k 2 sec 
State * 1 ,8 sec 
Intercept a -.3 sec 



• Word-to-r-Equat Ion 



R « .66 

Step » 3.1 sec 

Intercept » 8. k sec 



R « .92 ■ ' 
Computation B 5.7 sec 
Intercept 0 6. 3 sec 



R - .97 

Computation = 6.6 sep 
Move o -2.2 sec 
Intercept « 7.** sec 



R 63 .97 

Computation ■ 6.7 sec 
Move a -2,2 sec 
Stage * 3 sec 
Intercept « 7.2 sec 



-1 
o 
cr 

n> 
3 



v. 



r o 
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Table I'l 

Transcription of Subject 3 ftor Problem 5'» 



Equat Ion 

(8 + 3X)/2 - 3X - U 
(8 + 3X)/2 + II - JX 

8 + 3X « 2(3X - 11) 
8, + 3X - 6X - 22 
-30 » -3X 
X = 10 



Subject's Conlments 



The first thing I'waht to so Is get all variables on one side. So, 
I was doing a lot of stuff. (Pause) First, I would add II to both 
sides. 

Then I would, lets see. Oh, first I would have to divide, I mean 
multiply both sides by 2. (Points to first equation.) I usually 
write It with the fraction different. OK, so I mean here (points to 
first equation) multiply by 2. 

Oh, I would multiply the whole thing at one time. 

In one step I would subtract 6X and add 8. Subtract 8. 

Then X = 10. 
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